Safe multi-agent reinforcement learning for multi-robot control
نویسندگان
چکیده
A challenging problem in robotics is how to control multiple robots cooperatively and safely real-world applications. Yet, developing multi-robot methods from the perspective of safe multi-agent reinforcement learning (MARL) has merely been studied. To fill this gap, study, we investigate MARL for on cooperative tasks, which each individual robot not only meet its own safety constraints while maximising their reward, but also consider those others guarantee team behaviours. Firstly, formulate as a constrained Markov game employ policy optimisation solve it theoretically. The proposed algorithm guarantees monotonic improvement reward satisfaction at every iteration. Secondly, approximations theoretical solution, propose two gradient methods: Multi-Agent Constrained Policy Optimisation (MACPO) MAPPO-Lagrangian. Thirdly, develop first three benchmarks—Safe MuJoCo (Safe MAMuJoCo), Safe Robosuite MARobosuite) Isaac Gym MAIG) expand toolkit research communities. Finally, experimental results benchmarks indicate that our can achieve state-of-the-art performance balance between improving satisfying compared with strong baselines. Demos code are available link (https://sites.google.com/view/aij-safe-marl/).2
منابع مشابه
Dynamic Safe Interruptibility for Decentralized Multi-Agent Reinforcement Learning
In reinforcement learning, agents learn by performing actions and observing their 1 outcomes. Sometimes, it is desirable for a human operator to interrupt an agent 2 in order to prevent dangerous situations from happening. Yet, as part of their 3 learning process, agents may link these interruptions, that impact their reward, to 4 specific states and deliberately avoid them. The situation is pa...
متن کاملSafe, Multi-Agent, Reinforcement Learning for Autonomous Driving
Autonomous driving is a multi-agent setting where the host vehicle must apply sophisticated negotiation skills with other road users when overtaking, giving way, merging, taking left and right turns and while pushing ahead in unstructured urban roadways. Since there are many possible scenarios, manually tackling all possible cases will likely yield a too simplistic policy. Moreover, one must ba...
متن کاملMulti-Agent Reinforcement Learning
This thesis presents a novel approach to provide adaptive mechanisms to detect and categorise Flooding-Base DoS (FBDoS) and Flooding-Base DDoS (FBDDoS) attacks. These attacks are generally based on a flood of packets with the intention of overfilling key resources of the target, and today the attacks have the capability to disrupt networks of almost any size. To address this problem we propose ...
متن کاملMulti-Agent Deep Reinforcement Learning
This work introduces a novel approach for solving reinforcement learning problems in multi-agent settings. We propose a state reformulation of multi-agent problems in R that allows the system state to be represented in an image-like fashion. We then apply deep reinforcement learning techniques with a convolution neural network as the Q-value function approximator to learn distributed multi-agen...
متن کاملMulti-agent Relational Reinforcement Learning
In this paper we study Relational Reinforcement Learning in a multi-agent setting. There is growing evidence in the Reinforcement Learning research community that a relational representation of the state space has many benefits over a propositional one. Complex tasks as planning or information retrieval on the web can be represented more naturally in relational form. Yet, this relational struct...
متن کاملذخیره در منابع من
با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید
ژورنال
عنوان ژورنال: Artificial Intelligence
سال: 2023
ISSN: ['2633-1403']
DOI: https://doi.org/10.1016/j.artint.2023.103905